Refactor and a couple of fixes for adapter layer updates #1268

BenjaminBossan · 2023-12-14T14:34:55Z

This started as a simple refactor, but during this, I discovered a few issues too, which should now be fixed. This is probably best reviewed by checking each commit separately.

Here are the descriptions for the individual changes:

1. Refactor: Move LoRA `update_layer` to child classes

For LoRA, so far, we have update_layer for Linear, update_layer_embedding for Embedding, and update_layer_conv2d for Conv2d, all defined on LoraLayer.

We can simplify the code by always using the name update_layer, and by moving the layer-specific methods to the subclasses. So e.g. update_layer_embedding is moved to the Embedding class and renamed to update_layer. This way, the caller does not need to differentiate which type of layer it's calling, making it much easier (see the simplification in LoraModel._create_and_replace). It also makes the code less error prone (see change 4 described below).

Interestingly, this was already practiced for IA³, so the same change was not necessary there. But I did find the same method implemented twice, once on IA3Layer and once on Linear, so I removed one of the duplicates. For all other adapter methods, no change was required, as they only implement Linear.

Note: This could technically be backwards incompatible if users do some custom stuff with LoraLayers. I could add a note for next release that all update_layer_* methods have been consolidated to use the same update_layer name.

2. Systematic handling of `r` (rank) <= 0

Always raise an error when r <= 0, not only for LoRA. Also, removed later check for r > 0 in LoRA layers, since we already check for r <= 0.

Note: This could technically also be considered backwards incompatible, but r<=0 should not work correctly anyway, so better to raise an error right away.

3. Fix broken `repr` method on `QuantLinear`

Was indented too deep, thus not being applied.

4. Fix bug for updating Lora GPTQ and IA3 bnb layers

Before this fix, when adding a 2nd adapter to a model, we did not correctly check if there was already an adapter layer in the model when dealing with LoRA GPTQ or IA3 bnb layers. As a consequence, instead of updating the existing layers, we would create a new layer and the existing layer would be set as the base_layer of that new layer. Now, we correctly update the existing layer to add the new adapter.

For this fix to work correctly with LoRA and GPTQ, I had to add a check for qweight in update_layer, since we only checked for weight before. This was a mistake that didn't surface so far because of the error described in the previous paragraph.

Tests were extended to check this. They fail with the current main but pass on my machine with this PR.

For LoRA, so far, we have update_layer for Linear, update_layer_embedding for Embedding, and update_layer_conv2d for Conv2d, all defined on LoraLayer. We can simplify the code by always using the name update_layer, and by moving the layer-specific methods to the subclasses. So e.g. update_layer_embedding is moved to the Embedding class and renamed to update_layer. This way, the caller does not need to differentiate which type of layer it's calling. Interestingly, this was already practiced for IA³, so the same change was not necessary there. But I did find the same method implemented twice, once on IA3Layer and once on Linear, so I removed one of the duplicates

Always raise an error when r <= 0, not only for LoRA. Also, removed later check for r > 0 in LoRA layers, since we already check for r <= 0.

Was indented too deep, thus not being applied.

Before this fix, when adding a 2nd adapter to a model, we did not correctly check if there was already an adapter layer in the model when dealing with LoRA GPTQ or IA3 bnb layers. As a consequence, instead of updating the existing layers, we would create a new layer and the existing layer would be set as the base_layer of that new layer. Now, we correctly update the existing layer to add the new adapter. Note that for this fix to work correctly with LoRA and GPTQ, I had to add a check for qweight, since we only checked for weight before. Tests were added to check this. They fail with the current main but are fixed with this PR.

HuggingFaceDocBuilderDev · 2023-12-14T14:38:29Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

AdaLoraLayer is a subclass of LoraLayer, so just checking for isinstance(target, LoraLayer) will match AdaLoraLayer, which we don't want when it comes to updating a LoraLayer. Now, we explicitly check that the layer is *not* an instance of AdaLoraLayer.

pacman100

Thank you @BenjaminBossan for fixing many bugs while simplify a lot of code. LGTM! 🔥

younesbelkada

Thanks ! Can you double check integration tests pass, just in case?

BenjaminBossan · 2023-12-18T09:45:12Z

@younesbelkada Integration tests don't work on forks, right? So we can check after merging only.

younesbelkada · 2023-12-18T09:48:44Z

I just ran it here: https://github.com/huggingface/peft/actions/runs/7246319576

younesbelkada · 2023-12-18T09:52:18Z

Tests are green ! Feel free to merge!

BenjaminBossan added 4 commits December 14, 2023 14:23

Systematic handling of r (rank) <= 0

23a1076

Always raise an error when r <= 0, not only for LoRA. Also, removed later check for r > 0 in LoRA layers, since we already check for r <= 0.

Fix broken __repr__ method on QuantLinear

1a24792

Was indented too deep, thus not being applied.

BenjaminBossan requested review from pacman100 and younesbelkada December 14, 2023 15:34

pacman100 approved these changes Dec 15, 2023

View reviewed changes

Merge branch 'main' into refactor-move-update_layer-to-child-classes

127272a

younesbelkada approved these changes Dec 18, 2023

View reviewed changes

BenjaminBossan merged commit a0a46c0 into huggingface:main Dec 18, 2023
17 checks passed

BenjaminBossan deleted the refactor-move-update_layer-to-child-classes branch December 18, 2023 09:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor and a couple of fixes for adapter layer updates #1268

Refactor and a couple of fixes for adapter layer updates #1268

BenjaminBossan commented Dec 14, 2023

HuggingFaceDocBuilderDev commented Dec 14, 2023

pacman100 left a comment

younesbelkada left a comment

BenjaminBossan commented Dec 18, 2023

younesbelkada commented Dec 18, 2023

younesbelkada commented Dec 18, 2023

Refactor and a couple of fixes for adapter layer updates #1268

Refactor and a couple of fixes for adapter layer updates #1268

Conversation

BenjaminBossan commented Dec 14, 2023

1. Refactor: Move LoRA update_layer to child classes

2. Systematic handling of r (rank) <= 0

3. Fix broken __repr__ method on QuantLinear

4. Fix bug for updating Lora GPTQ and IA3 bnb layers

HuggingFaceDocBuilderDev commented Dec 14, 2023

pacman100 left a comment

Choose a reason for hiding this comment

younesbelkada left a comment

Choose a reason for hiding this comment

BenjaminBossan commented Dec 18, 2023

younesbelkada commented Dec 18, 2023

younesbelkada commented Dec 18, 2023

1. Refactor: Move LoRA `update_layer` to child classes

2. Systematic handling of `r` (rank) <= 0

3. Fix broken `repr` method on `QuantLinear`